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Abstract 



We have previously argued that the syntactic structure of natural lan- 
guage can be exploited to construct powerful polynomial time inference 
procedures. This paper supports the earlier arguments by demonstrat- 
ing that a natural language based polynomial time procedure can solve 
Schubert's steamroller in a single step. 
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1 Introduction 

Schubert's steamroller is a well known challenge problem for automated reasoning 
systems [Cohn, 1989], [Cohn, 1984], [Stickel, 1985], [Bibel et a/., 1987], [Davies, 
1988], [Oppacher and Suen, 1986], and [Walther, 1984], All previous automated 
solutions of Schubert's steamroller have been based on search procedures — proce- 
dures that search for proofs and which are not guaranteed to halt. In this paper we 
describe a different kind of solution to Schubert's steamroller — a solution without 
search. Although there is no established technical meaning to the term "search" , 
it seems reasonable to assert that a procedure guaranteed to terminate in poly- 
nomial time does not search. We present a natural general purpose polynomial 
time inference procedure capable of solving Schubert's steamroller given only the 
axioms of the problem plus three user specified "focus terms". 

The polynomial time inference procedure presented in this paper exploits, in 
an apparently essential way, aspects of natural language syntax. The inference 
procedure is denned by a set of inference rules. If R is a set of inference rules then 
we let \- R be the inference relation generated by R, i.e., for any set of formulas 
S and formula $ we write £ h fl $ if there exists a derivation of $ from the 
formulas in £ using the inference rules in R. A rule set R will be called tractable if 
\~ R is polynomial time decidable, i.e., there exists a procedure that is guaranteed 
to terminate in polynomial time in the written length of E and $ and that deter- 
mines whether or not S h R $. There exist useful, though incomplete, tractable 
sets of inference rules for first order logic. However, much more powerful tractable 
rule sets can be given if first order formulas are written in a non-standard syn- 
tax. A "taxonomic syntax" for first order logic is presented in [McAllester et a/., 
1989]. A more elaborate "Montagovian syntax" for first order logic, incorporating 
quantificational aspects of English noun phrases, is presented in [McAllester and 
Givan, 1989]. The polynomial time inference procedure used here to solve Schu- 
bert's steamroller is defined by a set of inference rules stated in a Montagovian 
syntax. The inference relation defined by these inference rules appears not to be 
definable in the classical syntax of first order logic. 

2 Montagovian Syntax 

In this section we present a Montagovian syntax for first order logic similar to 
that described in [McAllester and Givan, 1989]. The classical syntax for first order 
logic involves two grammatical categories — formulas and terms. The Montago- 
vian syntax presented below also involves two syntactic categories — formulas and 
class expressions. Formulas denote truth values and class expressions denote sets. 
Constant symbols and variables are treated as class expressions that denote sin- 

1 



gleton sets. In the following, $ is a formula; C, C u C 2 are class expressions; and 
R is a binary relation symbol. 

• A class expression is one of the following: 

- A class symbol (monadic predicate symbol). 

- A constant symbol or variable. 

- An intersection expression (intersection C\ C 2 ). 

- A union expression (union C\ C 2 ). 

- A A-expression (A x $(«)), where a: is a variable. 

- An expression (12 (some C)) or (R (every C)). 

• A formula is one of the following: 

- A subset formula of the form (every C\ C 2 ). 

- An intersection formula of the form (some C\ C 2 ). 

- An existence formula of the form (there-exists-a C). 

- An at-most-one formula of the form (at -most-one C). 

- Any Boolean combination of the above formulas. 

Before giving a formal semantics, it is useful to consider some examples of 
formulas and their associated meanings. If P and Q are class symbols then (every 
P Q) is a formula which is true if the set denoted by P is a subset of the set denoted 
by Q. If man is a class symbol that denotes the set of all men, and runs is a class 
symbol that denotes the set of all things that run, then the formula (every man 
runs) is true if every man runs. The formula (some man runs) is true if some 
man runs. 

If John is a constant symbol (or variable) then the formulas (every John 
runs) and (some John runs) are semantically equivalent and we can use (John 
runs) as an abbreviation for either formula. Similarly, we write (likes John) 
as an abbreviation for either of the class expressions (likes (every John)) or 
(likes (some John)). 

If owns is a relation symbol, and denotes the predicate which is true of two 
objects if the first owns the second, then the class expression (owns (some car)) 
denotes the set of individuals that own some car. If policeman is a class symbol 
that denotes the set of all policemen, then the formula (every policeman (owns 
(some car))) is true if every policeman owns a car. 



Our formal semantics for the Montagovian syntax is a (drastic) simplification 
of Montague's original semantics for English [Montague, 1973]. 1 Just as in classical 
syntax, a model of our Montagovian language is a first order model, i.e., a domain 
D together with an interpretation of constant, class, and relation symbols. Each 
first order model interprets each constant symbol as an element of its domain. A 
model also interprets each class symbol as a subset of its domain and each relation 
symbol as a binary relation on its domain, i.e., a set of pairs of domain elements. 

If M is a first order model, and p is a variable interpretation over M, i.e., a 
mapping from variables to elements of the domain of M, then we write V(e, M, p) 
for the semantic value of the expression e in the model M under variable interpre- 
tation p. If C is a class expression then V(C, M,p) is a subset of the domain of 
M. If $ is a formula, then V($, M,p) is a truth value, either T or F. 

• For class symbol P, V(P, M, p)\s the set M(P). 

• For constant c, V(c, M, p) is the singleton set {M(c)}. 

• For variable x, V(x, M, p) is the singleton set {/>(*)}. 

• V(( intersection d C 2 ), M, p) is V(C U M, p)nV(C 2 , M, p). 

• V((union C x C 2 ), M, p) is the set V(C U M, p) U V(C 2 , M, p). 

• V((A x $(*)), M, p) is the set of all d such that V($(x), M, p[x := d\) is T where 
p[x := d\ is the same as p except that it interprets x as d. 

• V((R (every C)),M,p) is the set of all d such that for every d! in V(C, M ,p) the pair 
<d, d'> is an element of the relation denoted by R. (Consider the class expression (loves 
(every child)).) 

• V( (R (some C) ) , M , p) is the set of all d such that there exists an element d! in V(C, M , p) 
such that that the pair <d, d'> is an element of the relation denoted by R. (Consider the 
class expression (loves (some child)).) 

• V((every C W), M, p) is T if the set V(C, M, p) is a subset of V(W, M, p). 

• V((some C W), M, p) is T if the setV(C, M, p)HV(W, M, p) is non-empty. 

• V((there-exists-a C), M, p) is T if V(C, M p) is non-empty 

• V((at-most-one C), M, p) is T if V(C, M p) has at most one member. 

• Boolean combinations of atomic formulas have their standard meaning. 

Binary relation symbols, in the presence of equality, are in some sense sufficient 
to express arbitrary first order facts. We leave it to the reader to verify that, if 
we restrict our attention to languages with only constants and unary and binary 

J Our class expressions play the role of both verb phrases, as in (owns (some car)) and of 
incomplete noun phrases, as in (brother-of (some policeman)). Montague, of course, 'treated 
these as separate syntactic categories. Montague also treated complete noun phrases, such as 
(every policeman), as another syntactic category with its own denotational semantics. The 
treatment of propositional attitudes makes Montague's formal language yet more complex. 



relation symbols, then every classical first order formula can be translated to a 
logically equivalent formula of Montagovian syntax and vice versa. Montagovian 
syntax is really just a syntactic variant of first order logic. 



3 Schubert's Steamroller 

Schubert's steamroller is a logical puzzle originally stated in English. Each sentence 
of the English statement of the problem is given below along with a translation of 
that sentence into a set of formulas in our Montagovian syntax for first order logic. 

Wolves, foxes, birds, caterpillars, and snails are animals, and there are some of 
each of them. 

(every wolf animal) (there-exists-a wolf) 
(every fox animal) (there-exists-a fox) 

There are some grains, and grains are plants. 

(there-exists-a grain) (every grain plant) 

Caterpillars and snails are much smaller than birds, which are much smaller than 
foxes, which are much smaller than wolves. 

(every caterpillar (is-smaller-than (every bird))) 
(every snail (is-smaller-than (every bird))) 
(every bird (is-smaller-than (every fox))) 
(every fox (is-smaller-than (every wolf))) 

Wolves do not like to eat foxes or grains, while birds like to eat caterpillars but 
not snails. 

-.(some wolf (eats (some fox))) -.(some wolf (eats (some grain))) 

(every bird (eats (every caterpillar))) -.(some bird (eats (some snail))) 

Caterpillars and snails like to eat some plants. 

(every caterpillar (eats (some plant))) 
(every snail (eats (some plant))) 

Every animal either likes to eat all plants or all animals much smaller than itself 
that like to eat some plants. 



(every animal 
(union 

(eats (every plant)) 
(A x (x (eats (every 

(intersection 
animal 
(is -smaller- than a;)))))))) 

Prove there is an animal that likes to eat a grain-eating animal. 

(some animal 

(eats (some (intersection 
animal 
(eats (some grain)))))) 

A formula of Montagovian Syntax is called quantifier-free if it does not contain 
any A-expressions. The "quantifiers" some and every that appear in noun phrases 
are considered to be quantifier-free combinators. In [McAllester and Givan, 1989] 
we show that satisfiability is decidable (NP-complete) for the quantifier-free frag- 
ment of the Montagovian syntax presented in that paper. Although the Mon- 
tagovian syntax presented here is somewhat more elaborate, we conjecture that 
the quantifier-free fragment remains decidable. When translated into our Mon- 
tagovian syntax, all of the sentences of Schubert's steamroller are quantifier-free 
except for the second to last sentence above, which involves a single A-expression. 



4 Polynomial Time Inference 

Figure 1 gives a set of 33 inference rules stated in our Montagovian syntax. We 
are actually interested in the rules in figure 1 plus all contrapositives of those rules. 
Each inference rule is analogous to an implication of the form ^ A • • • \P n -► $ 
where each # t is an antecedent and $ is the conclusion. A contrapositive of a rule 
#i A • • • $ n -> $ is a rule of the form 

$i A • • • A #,-_! A -.$ A vjr t+1 A • • • A tf n -> -.$,-. 

In the contrapositive, the conclusion has been interchanged with one of the an- 
tecedents and both of the interchanged formulas have been negated. If a given 
rule is semantically sound, then so is each of its contrapositives. We conjecture 
that the rule set consisting of the rules in figure 1 plus all contrapositives of those 



(1) tf, -.* 

(4) 



$ 
$ 



*V$ 



(2) 9 



(3) 



ntf 



-i-ilj 

(5) $ 



¥ 



$V$ 



(6) £vj^_->^ 



(7) (every C C) 

(10) (there-exists-a C) 
(some C C) 

(13) (every C W) 
(every W Z) 
(every C Z) 

(16) (at-most-one W) 
(every C W) 
(at-most-one C) 



(8) (there-exists-a c) 

(11) (some C W) 

(there-exists-a C) 

(14) (some C W) 
(every C Z) 
(some Z W) 

(17) -i(at-most-one C) 
(there-exists-a C) 



(9) (at-most-one c) 

(12) (some C W) 
(some W C) 

(15) (some C W) 

(at-most-one C) 
(every C WO 

(18) -.(every C WO 

(there-exists-a C) 



(19) (every C (union C WO) 

(21) (every (intersection C WO C) 

(23) (every C Z ), (every W Z) 
(every (union C WO Z) 

(25) (some C W) 

(there-exists-a (intersection C WO) 

(27) (every C (union W Z)) 
-n(some C W ) 
(every C Z) 

(29) (every C WQ 

(every (R (some C)) (i* (some WO)) 

(31) (some C WQ 

(every (R (every C)) (i? (some WO)) 



(20) (every W 7 (union C WO) 

(22) (every (intersection C WO WO 

(24) (every Z C ), (every Z WO 

(every Z (intersection C WO) 

(26) -.(there-exi sts-a C) 

(every W (R (every C))) 

(28) (every C (union W Z)) 
-.(some C Z) 
(every C WO 

(30) (every C WO 

(every (R (every WO) (72 (every C))) 

(32) (there-exist s-a (R (some C))) 
(there-exists-a C) 



Figure 1: Some inference rules for Montagovian Syntax. The letters C, W, and Z, 
range over class expressions, c ranges over constants and variables, $ and i range 
over formulas, and R ranges over relation symbols. 



rules is local (see below), and thus generates a polynomial time decidable inference 
relation. 

The inference rules in figure 1, together with their contrapositives, determine 
a sound inference relation for formulas expressed in our Montagovian syntax for 
first order logic. This (incomplete) first order inference relation appears not have 
any definition in the classical syntax for first order logic. 2 We have constructed a 
polynomial time inference procedure based on this set of inference rules. A general 
theoretical framework for constructing polynomial time inference procedures is 
presented in [McAllester, 1990]. Let R be any set of inference rules. The following 
definition is from [McAllester, 1990]. 

Definition: We write E H R $ if there exists a proof of $ from the 
premise set E such that every proper subexpression of a formula used 
in the proof appears as a proper subexpression of $, a proper subex- 
pression of some formula in E, or as a closed (variable free) expression 
in the rule set R. 

The following lemma is proved in [McAllester, 1990] . 

Lemma: For any given rule set R, there exists a procedure for deter- 
mining whether or not E H R # which runs in time polynomial in the 
written length of E and $. 

The inference relation h R is a restricted version of h R . For any rule set R, 
the relation h R is polynomial time decidable. If the relation h R is intractable, 
as is the case for any sound and complete set of rules for first order logic, then 
the polynomial time relation h R will be weaker than the relation \- R . However, 
there is a large class of rule sets for which these two relations are the same. The 
following definition is also from [McAllester, 1990] . 

Definition: A set R of inference rules is called local if the relation H R 
is the same as the relation \- R . 

An immediate consequence of the above definitions and lemma is that local rule 
sets are tractable, i.e., they generate polynomial time decidable inference relations. 
A variety of nontrivial local rule sets is presented in [McAllester, 1990]. Let M 
be the set of inference rules in figure 1 together with the contrapositives of those 
rules. We conjecture, although we have not yet proved, that M is local. Even if 
M is not local, H M is still polynomial time decidable, and it appears to be a very 
powerful inference relation. 

2 This is because the variables in the rules of figure 1 range over class expressions, but there 
are no class expressions in classical syntax. Consider for example the classical equivalent of the 
Montagovian class expression (brother-of (every man)). 



5 Socratic Proof Systems 

Local rule sets define polynomial time inference procedures. Of course, no poly- 
nomial time inference procedure can be complete for first order logic — the 32 
inference rules given in the previous section are not complete for our Montagovian 
syntax for first order logic. However, it is possible to exploit fast and powerful infer- 
ence procedures based on Montagovian syntax in constructing semi-automated ver- 
ification systems. In this section we describe a particular kind of semi-automated 
verification system called a Socratic Sequent system. 3 A proof in a Socratic sequent 
system is a series of lines where each fine is a sequent of the form E h $ where E 
is a set of formulas and $ is a formula. 

Definition: A Socratic sequent system is a pair <R, S> where R is a 
set of inference rules (deriving formulas from formulas) and S is a set 
of sequent rules (deriving sequents from sequents). 

Definition: An acceptable derivation in a Socratic sequent system 
<R, S> is a series of sequents where, for each sequent E h $, either 
E \-r $ (in which case the sequent is called obvious), or the sequent 
follows from earlier sequents using a rule in S. 

If the rule set R that defines the obvious sequents is local, then the inference 
relation h R is polynomial time decidable, and one can therefore determine, in 
polynomial time, whether a series of sequents is an acceptable derivation in the 
sequent system <R, S>. (Note that finding an acceptable derivation of $ from 
E is still an undecidable operation — the critical point is that once we have such a 
derivation, we can verify that it is acceptable in polynomial time). 

In this section we give a Socratic sequent system that is complete for our 
Montagovian syntax for first order logic and show how this Socratic system yields 
a one-step solution to Schubert's steamroller. The sequent rules for our proof 
system are given in figure 2. The rules of obviousness of our Socratic system 
include all of the inference rules in figure 1 plus the following two rules concerning 
A-expressions: 

(33) $(y), (focus-on y) 



(every y (Ax$(z))) 



The term "Socratic proof was introduced in [Crawford and Kuipers, 1989] to describe any 
system in which steps of a proof are verified using an automated reasoning procedure. Our notion 
of a Socratic sequent system is a special case of this general concept. 



(SI) 

(S3) 
(S5) 

(S7) 
(S9) 



SU{$} h * 
E U {-.¥} h * 



Eh* 

EU (focus-on x) h * 



Eh* 

E h (there-exists-a C) 
EU {(every x C)} h * 
Eh* 

EU {(every g C)} h (e very W (R x)) 
Eh (every W {R (every C))) 

E h (at -most-one Z) 

E h (every Z (R (some C))) 

EU {(every x C), (eve ry Z (R x))} h * 

Eh* 



(S2) 

(S4) 
(S6) 

(S8) 
(S10) 



Eh* 
EU{*} h * 



Eh* 
Eh* 



EU{tf}h* 

E h (some C W) 

EU {(every x C), (eve ry x W)} h * 

Eh* 

EU {(every x C)} h (e very x W) 
E h (every C W) 

S U {(every x x C), (ev ery x 2 C)} h (every x x x 2 ) 
E h {(at-most-one C)} 



Figure 2: The Socratic Proof Rules. In these rules C, W, and Z, are class expres- 
sions, $ is a formula, and x, x Y and x 2 are variables that do not appear free in E 
#, C, Z, or iy. 

(34) (every y (Ag*(s))), (focus-on y) 

*(y) 

Each of these rules has an antecedent of the form (focus-ony), where y must 
be variable. Formulas of this form are used to control the inference process and 
have no semantic content. The focus-on antecedents of the above rules restrict 
the application of these rules to "focus variables", i.e., variables y such that the 
formula (f ocus-ony) is given as a premise (there are no inference rules for deriving 
formulas of the form (f ocus-ony)). Note that the sequent rule S3 in figure 2 can 
be used to eliminate focus-on premises from sequents. If the A-expression rules 
were not restricted with focus-on antecedents, then the inference relation defined 
by those rules, together with the rules of figure 1, would be undecidable. Let M' 
be the set of inference rules including all rules in figure 1 and their contrapositives, 
plus the above two rules for quantifiers. We have a polynomial time implementation 
of the inference relation H M /, provided there is a bounded level of A-nesting. This 
implementation is constructed along the lines described in [McAllester, 1989]. We 
conjecture that M' is local, and thus that H M , is the same as \- M ,. 

Now let E be the set of formulas of Montagovian syntax used to represent the 
premises of Schubert's steamroller as given in section 3 and let $ be the formula 
to be proven. Our implementation of an inference procedure for the rule set W 



has been used to verify that: 

E U < ixf fox), (focus-on 3/). V \- M , $ 



(x w wolf), (focus-on £„,)/ 
(a;/ fox), (focus-on xj) , 
(xfc bird), (focus-on x\,) 



This sequent expresses the English statement "to see that $ follows from E, con- 
sider a wolf i w , a fox x s , and a bird x h — the result is then obvious". Repeated 
use of the Socratic inference rules S3 and S5 can be used to eliminate all premises 
other than E, and hence derive the sequent E h $. A simple user interface to the 
Socratic proof system can be used to automatically apply sequent rules, such as 
S3 and S5, that remove extraneous premises. Given this user interface, the above 
sequent is a one line solution to Schubert's steamroller. 



6 Discussion 

We have constructed a complete proof system for a non-standard syntax for first 
order logic. This proof system has the simultaneous features that proofs are short 
and yet, if our conjectures are correct, the acceptability of a proof is quickly ver- 
ifiable. The proofs in our system are so short that Schubert's steamroller can be 
proved in a single line, by far the shortest known proof in a proof system with 
polynomial time checkable proofs. 

The conciseness of the proofs in our proof system appears to be due to the 
power (and conjectured locality) of the inference rules given in figure 1. This 
power appears to depend fundamentally on the use of a non-standard syntax to 
express the inference rules— just what aspect of the new syntax makes this added 
expression possible is unclear, but one relevant observation is that the quantifier 
free fragment of the new syntax can express many facts which require quantifiers in 
classical syntax (e.g. (every man mammal)). Our experience indicates that the de- 
cision procedure for the inference relation H M immediately solves the vast majority 
of inference problems that can be stated in the fragment of Montagovian syntax 
that does not contain A-quantifiers. 4 The statement of Schubert's steamroller in 
Montagovian syntax contains only a single A-quantifier — a quantifier needed to 
represent the English anaphora "itself". Three instantiations of this quantifier are 
needed in the solution of Schubert's steamroller. Our one-line solution specifies 
the objects on which the quantifier is to be instantiated — the focus-on premises 
in the one-line solution control th e use of the instantiation rules 34 and 35. 

4 We conjecture that validity in the A-free fragment of our Montagovian syntax is decidable 
although it is known that the inference rules in figure 1 are not complete for A-free Montaeovian 
formulas. ° 
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The inference relation defined by the inference rules in figure 1 appears not to 
have any definition in the classical syntax of first order logic. Thus, Montagovian 
syntax appears to play an essential role in the specification of the inference relation 
and therefore in the construction of the a Socratic sequent system with extremely 
concise proofs. Although this suggests that natural language syntax plays an 
important role in human reasoning, it seems sufficient to merely claim that aspects 
of natural language syntax can be used to build powerful inference algorithms. 
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